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METHOD AND APPARATUS FOR DATA STORAGE AND 

RETRIEVAL 



FIELD OF THE INVENTION 



The present invention relates to data storage and retrieval systems. 
5 More particularly, embodiments of the present invention relate to data storage 
and retrieval systems for records management. 



BACKGROUND OF THE INVENTION 



10 For years, pundits have predicted the coming of the paperless office; 

an office environment in which there is no longer a need for expensive, 
messy, and disorganized paper files and records. This Utopian ideal has 
never materialized. Instead, businesses may be even more reliant on paper 
files and archives than in the past. This is particularly true in businesses 

15 which have multiple offices and which generate a large number of original 
documents such as signed contracts, compliance documents, or the like. In a 
business that generates a number of contracts, for example, it can be quite 
difficult to file and store the original document (or a facsimile thereof) in a 
manner allowing it to be readily located and retrieved days, if not years, later. 

20 

A typical records management system may consist of a paper filing 
system which stores each document in a logical manner, e.g., contracts may 
be filed by date, by contracting party, or by subject matter. The paper filing 
system may be supplemented with one or more databases which are used to 
25 provide further information about each document and to allow electronic 
searching for information entered in the database. 

Such approaches, however, do not allow businesses to efficiently and 
accurately store, and subsequently retrieve and identify, all relevant 
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documents. Existing systems are particularly unsuited for use in businesses 
having multiple offices or locations which desire to store and retrieve 
documents from each location. Often, each location of a business 
implements its own records management system. Each location often uses 
5 its own data storage and retrieval conventions, making it difficult (if not 
impossible) to share document information between locations. 

It would be desirable to provide a system and method which may be 
utilized to store, catalog, and retrieve documents from multiple locations of a 
10 business. It would further be desirable to provide a system and method which 
allows integration with existing document management databases and 
document generation and management systems. It would further be desirable 
to provide a system and method which allows the storage, retrieval and 
control of document images along with paper copies of documents. 

15 

SUMMARY OF THE INVENTION 

Embodiments of the present invention provide a system, method, 
20 apparatus, and computer program code for data storage and retrieval. 
According to some embodiments, data storage pursuant to the present 
invention includes receiving document information associated with a 
document to be stored. A pending record containing the document 
information is generated. The document information is verified, and an active 
25 record is generated if the verifying is successful. In some embodiments, at 
least part of the document information is received from a document 
generation system. In some embodiments, the document information is 
received from a user providing data identified by a template, where the 
template is identified based on a type of the document to be stored. 

30 

In some embodiments, document retrieval pursuant to the present 
invention includes receiving information associated with a desired document. 
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A record of a document management database containing the information is 
identified. A physical location of the document is identified based on the 
record. A location of an image of the document is identified based on the 
record as well. Other information from the record is displayed. 

With these and other advantages and features of the invention that will 
become hereinafter apparent, the nature of the invention may be more clearly 
understood by reference to the following detailed description of the invention, 
the appended claims and to the several drawings attached herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram overview of a data storage and retrieval 
system according to an embodiment of the present invention; 

FIG. 2 is a flow chart of a method according to some embodiments of 
the present invention; 

FIG. 3 is a block diagram overview of a data storage and retrieval 
system according to some embodiments of the present invention; 

FIG. 4 is a block diagram of a data storage and retrieval system 
controller according to some embodiments of the present invention; 

25 FIG. 5 is a tabular representation of a portion of a template database 

according to an embodiment of the present invention; 

FIG. 6 is a tabular representation of a portion of a document 
management database according to an embodiment of the present invention; 

30 
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FIG. 7 is a flow chart of a method for storing information associated 
with a data storage and retrieval system according to some embodiments of 
the present invention; 

5 FIG. 8 is a flow chart of a method for generating pending records from 

external systems in communication with a data storage and retrieval system 
according to some embodiments of the present invention; and 

FIG. 9 is an illustration of a user interface associated with a data 
10 storage and retrieval system according to some embodiments of the present 
invention. 

DETAILED DESCRIPTION 

15 Applicants have recognized that there is a need for a system, method, 

apparatus, and computer program code for data storage and retrieval which 
overcomes drawbacks of existing systems and approaches. Features of 
embodiments of the present invention will now be described with a system 
overview, a description of an embodiment of a system architecture, and 

20 processes pursuant to some embodiments of the present invention. 

System Overview 

Reference is now made to the drawings, beginning at FIG. 1, where a 
25 block diagram of a data storage and retrieval system 100 according to an 
embodiment of the present invention is shown. As depicted in FIG. 1, data 
storage and retrieval system 100 includes a controller 400 in communication 
with a client device 10. For example, a user may input information associated 
with a document via client device 10. Client device 10 may then transmit 
30 appropriate information to controller 400, which in turn may store or retrieve 
document information based on the information received from client device 
10. 
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For example, a user operating client device 10 may interact with 
controller 400 to input new document information to create a new record of 
information about the document (and, in some embodiments, to associate the 
record with one or more document images). The user may also operate client 
5 device 10 to interact with controller 400 to search for, locate, and retrieve 
document information stored in the system (including, in some embodiments, 
one or more document images). Further, the user may also interact with the 
system to retrieve document images or to locate and/or retrieve physical 
documents which have been stored using embodiments of the present 
10 invention. 

FIG. 2 is a flow chart of a method according to some embodiments of 
the present invention. The flow chart in FIG. 2 and the flow charts in other 
figures described herein do not imply a fixed order to the steps, and 
15 embodiments of the present invention can be practiced in any order that is 
practicable. The method shown in FIG. 2 may be performed, for example, by 
the data storage and retrieval system controller 400. 

Process 200 begins at 202 where document information is entered and 
20 verified. Processing at 202 may be performed by one or more users 

operating client devices 10 and interacting with controller 400. For example, a 
user in a company's legal department may wish to input information about a 
new contract entered into by the company. In the example, the user may 
enter information about the contract, including information identifying a 
25 particular classification or collection into which the contract is to be classified, 
the name of the contract, various categories in which the contract should be 
categorized, and other information particularly identifying the contract so that 
it may be accurately classified, categorized and indexed for ready retrieval in 
the future. 

30 

Other information identifying the origin, owner, location, and archival 
information associated with the contract may also be entered. This document 
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information is utilized, in some embodiments, to generate a pending record in 
a database accessible by data storage and retrieval system controller 400. In 
some embodiments, a data verification step is also performed (e.g., in 
conjunction with or subsequent to processing at 202) before an active record 
5 is generated for the document. In some embodiments, this data verification is 
performed by one or more records administrators operating client devices 10 
and interacting with a pending record database accessible by data storage 
and retrieval system controller 400. 

1 0 Processing continues at 204 where the document information entered 

at 202 is linked or otherwise associated with one or more document images 
and / or one or more hard copies of the document Processing at 204 may, 
for example, be performed under the control of a user operating client device 
10 and interacting with controller 400. Further, in some embodiments where 

15 document images are linked at 204, processing may also involve interaction 
with one or more scanning or imaging devices which are operated to generate 
document images for association with the document information entered at 
204. 

20 According to some embodiments of the present invention, document 

images may be easily and accurately linked to document information, allowing 
ready access, indexing and retrieval of document information and images. In 
some embodiments, processing at 204 includes identifying a physical location 
of one or more hard copies of the document. This is particularly useful, for 

25 example, in environments where an entity has multiple offices which maintain 
documents. In such an example, processing at 204 may include identifying a 
physical location (e.g., the particular office and even the particular shelf) 
where a document is located. 

30 Processing continues at 206 where users operating client devices 10 

may interact with controller 400 to access/retrieve document information, 
images and/or hard copies. Embodiments of the present invention collect, 
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verify and store information in a manner which allows ready identification, 
location and retrieval of document information, document images and copies 
of documents, even in environments where documents are generated and 
stored in multiple geographically diverse locations. 

5 

System Architecture 

FIG. 3 is a block diagram overview of a data storage and retrieval 
system 300 according to another embodiment of the present invention. As in 
1 0 FIG. 1 , controller 400 is in communication with client device 1 0. Further, 

controller 400 is in communication with satellite systems such as one or more 
document generation systems 30, one or more imaging systems 40 and one 
or more archival systems 50. 

15 As used herein, devices (such as controller 400, client device 10, 

document generation system 30, imaging system 40, and archival system 50) 
may communicate via a communication network 20, such as a Local Area 
Network (LAN), a Metropolitan Area Network (MAN), a Wide Area Network 
(WAN), a proprietary network, a Public Switched Telephone Network (PSTN), 

20 a Wireless Application Protocol (WAP) network, a wireless LAN (e.g., in 
accordance with the Institute of Electrical and Electronics Engineers 802.1 1 
standard), a Bluetooth network, an Infrared Radiation (IR) network, and/or an 
IP network such as the Internet, an intranet or an extranet. As used herein, 
the term "communications" can refer to wired and/or wireless communications 

25 as appropriate. Note that the devices shown in FIG. 3 need not be in constant 
communication. For example, controller 400 may communicate with a client 
device 10 on an as-needed or periodic basis. 

Although a single controller 400 is shown in FIG. 3, any number of 
30 controllers 400 may be included in the data storage and retrieval system 300. 
Similarly, any number of client devices 10, or any other device described 
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herein, may be included in the data storage and retrieval system 300 
according to embodiments of the present invention. 

Controller 400, client devices 10, and satellite devices 30, 40 and 50 
5 may be any devices capable of performing the various functions described 
herein. Client device 10 may be, for example: a Personal Computer (PC), a 
portable computing device (e.g., a laptop computer), a Personal Digital 
Assistant (PDA), or a dedicated data storage and retrieval system 300 
terminal. Note that the client device 10 may be associated with a full-blown 

10 workstation application or a thin-client browser-based application. In one 
example environment, a business may utilize features of embodiments of the 
present invention over a corporate intranet, allowing access to individual 
employees operating personal computers configured as client devices 10. In 
this manner, a large number of users may access and store document 

15 information using the system. 

According to some embodiments, client device 10 may, for example, 
control user functionality (e.g., by supporting applicable user interactions). 
Client device 10 may also perform session management (e.g., by providing 

20 user login and logout capability, managing a physical connection including a 
connection status notification to a user, and issuing a logout when 
appropriate). In some embodiments, client device 10 may be operated as a 
system administrator device enjoying greater system privileges than a 
standard user device. Those skilled in the art will recognize that a variety of 

25 different access and control privileges may be granted to different users 
accessing document information via system 300. 

According to some embodiments, a user enters information associated 
with a document to be stored or retrieved via the client device 10. In 
30 embodiments where a user wishes to store document information, the user 
may be prompted to provide particular types and content of information to fully 
describe the document for future retrieval. In embodiments where a user 
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wishes to retrieve a document, the user may be prompted to enter search 
information or other data inputs to enable retrieval of stored document 
information. 

5 Information provided by the user is transmitted from client device 10 to 

controller 400 via communication network 20, and controller 400 may process 
the information to facilitate the storage or retrieval of document information. 
According to some embodiments, controller 400 is also in communication with 
one or more satellite systems, such as a document generation system 30, an 
10 imaging system 40, and an archive system 50, each of which may play a role 
in the storage and/or retrieval of document information. For example, in some 
embodiments, document information used to populate a document record 
may be automatically retrieved from information originally created by one or 
more document generation systems 30. 

15 

As a specific example, a legal department of a company may utilize an 
automated or partially automated system to generate contracts. According to 
embodiments of the present invention, this contract generation software 
interfaces with data storage and retrieval system 300 to generate a pending 

20 document record for each new contract generated by the contract generation 
software. This ensures that new documents created within the company are 
accurately and easily entered into system 300. Further, according to some 
embodiments, the record generated is a pending document record which may 
be completed or converted into an active document record after the contract 

25 has been completed (e.g., after the contract has been signed). Embodiments 
of the present invention may interface with any of a number of different 
document generation systems 30 to facilitate creation of new document 
records. 

30 One or more imaging systems 40 may also be in communication with 

controller 400 via communication network 20. In some embodiments, a 
number of different imaging systems 40 may be provided, each distributed 
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geographically to support the imaging needs of different locations of a 
business operating system 300. Any of a number of different imaging 
systems 40 may be used in conjunction with embodiments of the present 
invention. In one currently-preferred embodiment, imaging systems which 
allow the fast and accurate generation of Adobe Acrobat® portable document 
format (PDF) files is preferred. In some embodiments, software such as the 
Ascent Capture® software provided by Kofax® may be used to organize and 
permit distributed capture and manipulation of document images. In some 
embodiments, imaging systems 40 may also be provided with optical 
character recognition software which facilitates the conversion of document 
images into text files. According to some embodiments of the present 
invention, some or all of the imaging systems 40 in communication with 
controller 400 may be operated by third party service providers. 

Archive system 50 may be one or more systems operated to control 
and track document archive information. For example, archive system 50 
may be a system operated by, or on behalf of, an archival service provider 
such as Iron Mountain, Inc. of Boston, Massachusetts. For example, archive 
system 50 may be used to maintain and update document archive information 
(such as storage box information, archive status, etc.) relating to documents 
which have been sent to an archival service for storage (and perhaps for later 
destruction). Embodiments of the present invention permit the data storage 
and retrieval system of the present invention to keep track of documents 
which have been forwarded to an archival service for custody. According to 
some embodiments, a user operating client device 10 may interact with the 
data storage and retrieval system 300 to request the return of a particular 
document from its storage location at an archival service. Further, an 
administrative user may operate client device 10 to input archival information 
regarding a particular document into a document record stored by controller 
400. Other information may also be shared between controller 400, client 
device 10, and archival system 50. 
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Note that controller 400 may communicate with client device 10, 
document generation system 30, imaging system 40, and archive system 50 
via a single communication network 20 or via different communication 
networks. 

5 

Referring now to FIG. 4, a more detailed view of data storage and 
retrieval system controller 400 is shown that is descriptive of the devices 
shown, for example, in FIGS. 1 and 3 according to some embodiments of the 
present invention. Controller 400 comprises a processor 410, such as one or 
10 more INTEL® Pentium® processors, coupled to a communication device 420 
configured to communicate via a communication network (not shown in FIG. 
4). Communication device 420 may be used to communicate, for example, 
with one or more client devices 10 and/or satellite devices (such as systems 
30, 40 and 50). 

15 

Processor 410 is also in communication with an input device 440. 
Input device 440 may comprise, for example, a keyboard, a mouse or other 
pointing device, a microphone, knob or a switch, an IR port, a docking station, 
and/or a touch screen. Input device 440 may be used, for example, to enter 
20 information (e.g., information identifying a document to be stored or retrieved). 

Processor 410 is also in communication with an output device 450. 
Output device 450 may comprise, for example, a display (e.g., a display 
screen), a speaker, and/or a printer. Output device 450 may be used, for 
25 example, to output information about a document to be stored or retrieved 
from the data storage and retrieval system. 



Processor 410 is also in communication with a storage device 430. 
Storage device 430 may comprise any appropriate information storage 
30 device, including combinations of magnetic storage devices (e.g., magnetic 
tape and hard disk drives), optical storage devices, and/or semiconductor 



11 



Attorney Docket No.: G08.004 
Express Mail Label No.: ET029934429US 

memory devices such as Random Access Memory (RAM) devices and Read 
Only Memory (ROM) devices. 

Storage device 430 stores a program 415 for controlling processor 410. 

5 Processor 41 0 performs instructions of program 41 5, and thereby operates in 
accordance with the present invention. For example, processor 410 may 
receive document information, identify document templates, present 
document templates to users entering document information, search for and 
identify existing document information, search for and identify document 

10 images, etc. 

Storage device 430 also stores databases, including a template 
database 500, and a document management database 600. These 
databases are described in detail below and depicted with exemplary entries 

15 in the accompanying figures. As will be understood by those skilled in the art, 
the schematic illustrations and accompanying descriptions of the databases 
presented herein are exemplary arrangements for stored representations of 
information. A number of other arrangements may be employed besides 
those suggested by the tables shown. Similarly, the illustrated entries of the 

20 databases represent exemplary information only; those skilled in the art will 
understand that the number and content of the entries can be different from 
those illustrated herein. 

Databases 

25 

Referring now to FIG. 5, a table represents a template database 500 
that may be stored at (or accessible by) controller 400. The table includes 
entries identifying a number of different templates which are available to 
prompt and otherwise control the input of document information by users 
30 operating client devices 10. The table also defines fields 502-506 for each of 
the entries. The fields specify: a template identifier 502, a collection name 
504, and a file name 506. The information in template database 500 may be 
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created and updated, for example, based on information received from a 
system administrator operating client device 10. For example, a system 
administrator, librarian, or other authorized user may generate, modify and 
store template information as needed. 

5 

Template identifier 502 may be, for example, an alphanumeric code 
associated with a particular document template which has been created for 
use in conjunction with system 300. Template identifier 502 may be 
generated by, for example, controller 400. 

10 

Collection name 504 may be, for example, information identifying one 
or more document collections with which the template identified by template 
identifier 502 is to be used. For example, in a system used to store and 
retrieve legal documentation, a variety of different collections may be 

15 established and utilized to accurately store and retrieve legal documents in a 
logical manner. As an illustrative example, documents may be categorized 
into collections such as: "AFFILIATES AND SUBSIDIARIES"; "AFFILIATES 
AND SUBSIDIARIES (Topical)"; "CONTRACTS"; "CONTRACTS - 
CONFIDENTIALITY AGREEMENTS"; "CONTRACTS - SOFTWARE 

20 AGREEMENTS"; "CONTRACTS -EMPLOYMENT AGREEMENTS"; 

"ATTACHMENTS, GARNISHMENTS, and LIENS", etc. Some or all of these 
collections may have specific data collection requirements which may be 
specified using one or more templates. As a result, users inputting new data 
into system 300 may be directed to enter data in a repeatable manner, 

25 allowing documents to be easily classified and retrieved. Other types of 
collections may also be used, such as, for example, "Contracts-Vendor", 
"Contracts - Soft Dollar Agreements" or any other document classification 
which may be used to sort documents. 

30 File name 506 may be, for example, information identifying a file 

location of the template identified by template identifier 502. The template file 
may be in any of a number of different formats and the name may indicate a 
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location on a network or in data storage device 430 and accessible to 
controller 400. According to some embodiments, when a user indicates a 
desire to input information about a new document, the user is asked to select 
a classification of the document. The classification selected may be used to 
5 retrieve the appropriate template file from template database 500. The 
template file may be presented as, for example, a form on client device 10 
which prompts the user to input particular types of information as defined by 
the particular template. 

1 0 In some embodiments, a default template may also be defined which 

may be presented to direct user input in cases where no other template 
identifier is appropriate or identifiable for particular document information to be 
entered by a user. 

1 5 Referring now to FIG. 6, a table represents a document management 

database 600 that may be stored at (or accessible by) controller 400. The 
table includes entries identifying a number of different documents which have 
been entered into data storage and retrieval system 300. In some 
embodiments, data storage and retrieval system 300 may have separate 

20 databases for pending document entries and for active, or approved, 

document entries. In some embodiments, a single database may be used to 
store both types of entries, but pending records may be flagged or otherwise 
indicated as not having been finally approved for storage in the system. 
Document management database 600 includes a number of fields 602-620 for 

25 each of the entries. The fields specify: a record identifier 602, a collection 
name 604, a file name 606, a primary subcategory 608, a secondary 
subcategory 610, a tertiary subcategory 612, location information 614, 
creation information 616, a media type 618 and archive information 620. 

30 The information in document management database 600 may be 

created and updated, for example, based on information received from a user 
operating client device 10. For example, a user may interact with controller 
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400 to enter document information into document management database 600. 
in some embodiments, the nature and extent of the data stored in document 
management database 600 may vary based on the type of document about 
which information is being entered. In some embodiments, the nature and 
5 extent of data stored may be governed by a template selected from template 
database 500 (FIG. 5) based on a classification of document about which 
information is to be entered. In some embodiments, records and fields of 
information in document management database 600 may be populated using 
data generated by, and forwarded from, one or more satellite systems, such 
10 as document generation system 30 (FIG, 3). 

Record identifier 602 may be, for example, an alphanumeric code 
associated with a particular document for which information is stored. 
Record identifier 602 may be generated by, for example, controller 400 (e.g., 
15 new record identifiers may be assigned sequentially as new records are 
established). 

Collection name 604 includes information identifying a particular 
collection which the document is associated. A number of different types of 

20 collections may be assigned and established to group and sort different types 
of documents. For example, in a company which generates a large number of 
legal and compliance documents, different collections may be established for 
different types of legal documents as well as different types of collections. 
Applicants have found that such sorting by collection allows more efficient 

25 identification and retrieval of documents. 

File name 606 may include, for example, information identifying a 
name of the document stored using the system of the present invention. 
Different naming conventions may be provided based on the type of 
30 document. For example, for contracts generated by the legal department, the 
name of the contract may be the name of the entity with which the business 
has contracted or the name by which the contract is referred to. Other 
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naming conventions may also be used to easily identify and locate information 
after it has been stored. 

Primary, secondary, and tertiary subcategories 608-612 may include 
5 information used to categorize and locate each document. The particular type 
of data stored for each category may depend, in part, on the collection in 
which the document is associated with. For example, a document in the 
"CONTRACT" collection may require that the contracting party name be 
included in the file name field 606, the nature of the agreement may be stored 
10 in the primary subcategory 608, and further descriptive information about the 
agreement be stored in the secondary and tertiary subcategories 610, 612. 
Other types of categorization information may also be provided to facilitate 
ready identification and retrieval of documents stored in document 
management database 600. 

15 

Location information 614 may be, for example, information specified by 
a user operating client device 10 which specifies a particular location of the 
document associated with the record identified by record identifier 602. For 
example, location information 614 may include information specifying a 

20 business region, a department, an origin of the document, and the actual 
physical location of the document. This information may be used to help 
locate paper copies of documents stored and indexed using embodiments of 
the present invention. In some embodiments, where paper copies of the 
document have been sent to an archival unit, information may be provided 

25 particularly identifying the archival location so that the document could be 
readily retrieved from the archival location. 

In some embodiments of the present invention, archival instructions for 
a particular document identified by a particular document record in database 
30 600 may be forwarded to the appropriate archive system 50. For example, if 
an original document is stored by an archival service provider in their upstate 
New York warehouse, a user operating client device 10 may send the archival 
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service provider explicit instructions about retrieving the document (including, 
for example, information identifying the location and the storage box number) 
so that the original document can be quickly and accurately delivered to the 
requesting user. In some embodiments, communication between system 300 
5 and the archival service provider may be via electronic mail, the Internet, 
telephone, or the like. 

Creation information 616 may be, for example, information identifying 
the entity and/or individual which created the document associated with 
10 record identifier 602. For example, information may be provided identifying 
the author of the document, the date the document was created, the date the 
record was opened, the status of the record (e.g., active, archived, closed, 
etc.). This information may be used to track status of the document and the 
record as well as to identify the origination of the document. 

15 

Media type 618 may be, for example, information identifying the types 
of media in which the document exists. For example, a document classified 
and stored using embodiments of the present invention may be stored as an 
image document (e.g., in PDF format or some other image format), a hard- 

20 copy document, or a combination of the two. In some embodiments, a text 
version of the document may also be stored (e.g., in a word processing 
format, ascii text, or the like). In some embodiments, a pointer to the location 
of the image or text version of the file may also be provided allowing ready 
access to the file. In some embodiments, media type 618 may be populated 

25 with information after a document record has been created, and even after the 
document record has been approved and transformed into an active record. 
For example, a document image may be generated some time after the 
original document record was created. According to embodiments of the 
present invention, the document image may be easily associated with the 

30 document record by referencing the record identifier 602 associated with the 
document record. In some embodiments, the document image may be 
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named using the record identifier 602 as part of the document name, allowing 
ready association between the image and the record. 

Archive information 620 may be, for example, information identifying 
5 archive status or location information for the document associated with record 
identifier 602. Archive information 620 may include, for example, information 
about the retention of a particular document (e.g., indicating a period of time 
for which the document will be maintained), a planned destroy date, a 
required destruction authorization, information about an archival service with 
10 which a document has been stored, etc. Other information useful or 
necessary to identify, locate, and maintain documents stored using 
embodiments of the present invention may also be provided. 

Process Description 

15 

Reference is now made to FIG. 7, where a flow chart 700 is shown 
which represents the operation of an embodiment of the present invention. 
The particular arrangement of elements in the flow chart of FIG. 7 is not 
meant to imply a fixed order to the steps; embodiments of the present 
20 invention can be practiced in any order that is practicable. 

Flow chart 700 depicts a process for generating a document 
management database record (e.g., for storage in document management 
database 600 of FIG. 6). Flow chart 700 may be a process conducted 

25 between a user operating client device 10 and controller 400. The process 
begins at 702 where controller 400 receives a request to add a document to 
document management database 600. This request may be initiated by a 
user operating client device 10 or it may be initiated by a satellite system such 
as a document generation system 50 (FIG. 3). In some embodiments, the 

30 user operating client device 10 is a records management employee who is 
responding to a submission from a document owner or other individual. 
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in one embodiment, processing at 702 may involve a user pointing a 
browser of client device 10 to a network address of controller 400 to initiate a 
submission. Processing at 702 may involve the user submitting data 
indicating a request to add a document. The data may include data 
5 identifying a collection to which the document is to be added. 

Processing continues at 704 where a determination is made whether a 
preexisting record exists for the document. This may entail, for example, 
prompting the user for further information about the document to enable a full 

10 search of the database to determine if the document has been previously 
stored in the database. If processing at 704 indicates that the document has 
previously had a record established, processing may continue to 706 where a 
determination is made whether the record requires an amendment. For 
example, in some situations, a document record may have been created 

15 which is now out of date (e.g., as the result of amendments or updates to the 
document, etc.). If no amendments are required, processing continues at 710 
and the existing record is maintained without modification. 

If an amendment is required, or if no preexisting document record is 
20 located within the system, processing continues to 712 where a template is 
selected to guide user data entry. In some embodiments of the present 
invention, different templates gathering different types of information may be 
selected based on the type of document involved. For example, a different 
template may be used to prompt user input about a document which is going 
25 to be categorized in the collection titled "CONTRACT - CONFIDENTIALITY 
AGREEMENT" than for a document to be placed in the collection titled 
"CONTRACT - SOFT DOLLAR AGREEMENT". In some embodiments, 
selection of the appropriate template may involve simply prompting the user to 
indicate the collection in which the document is to be placed. 

30 

Processing continues at 714 where document data is received 
pursuant to the selected template. Once a template has been selected at 
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712, it is presented to a user to guide the user in entering information about 
the document An example template is shown in FIG. [10]9 which will be 
discussed further below. A user operating client device 10 may enter 
document data requested by the selected template to provide detailed 
5 information about the document. Upon completion of the template, the user 
may submit the information by following instructions on the template (e.g., by 
selecting a "Submit" option on a menu or the like). 

Processing continues at 716 where the user may link the document 
10 information provided at 714 with one or more document images (if any). In 
some embodiments, document images may be linked by simply associating 
S the document images with record identifier 602 (FIG. 6) of the document 

~; management database. In some embodiments, document images may be 

J? generated in a separate process (e.g., after process 700 has been performed 

l ri 15 and a record identifier has been established for the document) and the 

jji images are associated with the document record created by process 700 by 

% r associating the images with the appropriate record identifier 602. In other 

j** embodiments, processing at 716 may involve inputting information identifying 

] "Z a location of pre-generated document images. In either event, an association 

O 20 between the document record and the document images is established such 

that a user may readily locate both. In some embodiments, processing at 716 
may also include linking the document record with a text file of the record 
(which may be generated, e.g., using OCR software or the like). 

25 Processing continues at 718 where a pending document record is 

generated which includes the information received at 714 and images (if any) 
linked at 716. According to some embodiments of the present invention, only 
active document records are available for searching and retrieval by users 
operating client devices 10. Prior to becoming an active document record, a 

30 pending record is generated and document data contained therein is verified. 
In some embodiments, the verification of data at 718 may include review by 
one or more records administrators responsible for maintaining the 
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consistency and accuracy of data stored in the system. In some 
embodiments, the verification of data at 718 may include automated analysis 
of the record to detect errors or omitted items. For example, certain templates 
may have required fields that must contain data. If data is not contained in 
5 the fields, processing at 720 will indicate that the submitted data is missing 
information. If processing at 720 indicates that data of the pending record is 
not verified, a user may be prompted to reenter the information. In some 
embodiments, a records administrator may be able to override a negative 
determination at 720. 

10 

Once the data in the pending record have been verified, processing 
continues at 722 where a live or active record is generated. In some 
embodiments, this may simply entail switching a flag in the record to indicate 
that the record has been approved and is now an active record. In other 
15 embodiments, processing at 722 may entail moving the data from a pending 
database to an active database. Once the active record has been generated 
at 722, other users of the system may search and retrieve information 
contained in the new document record. 



20 In some embodiments, processing may continue at 724 where one or 

more labels are generated based on information contained in the active 
document record. For example, a records administrator or other user 
operating a client device 10 may generate one or more labels to label a 
physical copy of the document or a file containing the physical copy of the 

25 document. A variety of different formats and types of labels may be 

generated, depending on the location of the user and the location of the file. 
In some embodiments, labels will automatically be generated for new 
documents based on information in document management database 600. 
Label formats may vary based on the location of the physical document (e.g., 

30 files located in one office may be stored in lateral hanging folders, while files 
located in another office may be stored in buff folders; each may require a 
separate label format which is automatically generated based on information 
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in database 600). According to some embodiments of the present invention, 
system 300 may automatically select the appropriate label format based on 
the location of the document or other information about the document 
contained within document management database 600. 

5 

Referring now to FIG. 8, a process 800 is shown for generating a 
pending record in document management database 600 from a satellite 
system (such as, for example, document generation system 30 of FIG. 3). 
According to some embodiments of the present invention, the document 

10 imaging and retrieval system may interface with a number of legacy systems 
and other satellite systems which are used to create, manage, or otherwise 
manipulate documents. One example of such a satellite system is a document 
generation system such as a contract authoring or generation system. 
According to some embodiments of the present invention, when a document 

15 is generated by document generation system 30, a pending record in 

document management database 600 is also generated. As a result, data 
entry is reduced thereby reducing potential errors and costs. 

Process 800 begins at 802 where a document is generated or 
20 otherwise manipulated in a satellite system. In an example where the satellite 
system is a contract authoring tool, processing at 802 may involve a user 
operating the contract authoring tool to generate a new contract document. 

Processing continues at 804 where, pursuant to embodiments of the 
25 present invention, information from the contract authoring tool is used to 
generate a pending record identifying the new document This pending 
record may be generated with data used to populate a new record of 
document management database 600. In some embodiments, the pending 
record generated by the satellite system is in a common format (e.g., comma 
30 delimited text, text, or the like). In other embodiments, the pending record is 
generated in a format used by document management database 600. In 
other embodiments, the pending record is generated in a format of the 
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satellite system which is later translated into a format used by document 
management database 600. As a result, embodiments of the present 
invention may receive data from a variety of different types of satellite 
systems. By generating pending records with data from these systems, the 
5 time, expense, and potential errors associated with reentering the data is 
avoided. 

Once the pending record is generated at 804, the pending record is 
forwarded to the data storage and retrieval system for entry in document 

10 management database 600. Once the document is finalized, the record may 
be transformed into an active record (e.g., as described above in conjunction 
with the description of FIG. 7). Active records may be amended, augmented, 
and otherwise modified by authorized users. For example, an active record 
may be amended to refer to a later-created image file or to indicate updated 

15 archival information or status data. 

Referring now to FIG. 9, a example user interface 900 is shown of a 
template according to some embodiments of the present invention. User 
interface 900 may be displayed on a display unit of client device 10 to direct a 

20 user to enter particular types of data to create a new document record for 
entry in document management database 600. As shown, user interface 900 
includes a plurality of different fields, including fields in areas 902, 904 and 
906. Area 902 includes fields used to prompt a user to enter classification 
and searchable information about the document, including a primary collection 

25 with which the document is to be associated (which may be selected from a 
drop-down list of collections); a filename; and one or more categories which 
may be used to identify the document. A user submitting a new document (or 
a records administrator entering information about a new document) may 
interact with user interface 900 and input document information into these 

30 fields to facilitate later retrieval of the document. 
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User interface 900 also includes an area 902 which includes fields 
prompting a user to enter data about the creation of the document (e.g., the 
date opened, the attorney name, the VP/analyst name, the region, department 
and document origin, etc.) which may be useful in locating and understanding 
the document. Area 904 includes fields prompting a user to enter data about 
archive information, label information, and location information. Other 
information may also be solicited from an individual entering data regarding a 
document to be entered into the system. 

Other types and configurations of user interfaces may also be used in 
conjunction with embodiments of the present invention to solicit information, to 
facilitate searching, and to manage documents stored within the system as 
well. 

Although the present invention has been described with respect to a 
preferred embodiment thereof, those skilled in the art will note that various 
substitutions may be made to those embodiments described herein without 
departing from the spirit and scope of the present invention. 
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